Decoupled Vector Architectures: a Rst Look

نویسندگان

  • Roger Espasa
  • Mateo Valero
چکیده

The purpose of this paper is to show that using decoupling techniques in a vector processor, the performance of vector programs can be greatly improved. We will show how, even for an ideal memory system with no latency, decoupling provides a signiicant advantage over standard mode of operation. We will also present data showing that for more realistic latencies, decoupled vector architectures perform substantially better than non-decoupled vector architectures. We will also introduce a bypassing technique between the queues and show how it can reduce the total memory traac. A side eeect of the decoupling technique presented is that it tolerates so well long memory latencies that could make feasible to use very slow DRAM parts in vector computers in order to reduce cost.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance of the decoupled ACRI-1 architecture: the perfect club

This paper examines the performance potential of decoupled computer architectures on real-world codes, and includes the rst performance bounds calculations to be published for the highly-decoupled ACRI-1 computer architecture. It also constitutes the rst published work to report on the eeectiveness of a decoupling Fortran90 compiler. Decoupling is an architectural optimisation which ooers very ...

متن کامل

Effective usage of vector registers in decoupled vector architectures

Thz.spaptr presemts a study of the tmpact ofreduclng the vector regtsterstze m a decoupled vector architecture. In traditional in-order vector architectures, loltqvectorr egzstersh avetypically been the norm. We start presenting data that shows that, even for highly ucctorz.~able codes, only a small, fraction ojall elements of a long vector regzster are actually used. Lfre also show that reduct...

متن کامل

Speculative Parallelization in Decoupled Look-ahead Architectures

One well known approach to mitigate the impact of branch mispredictions and cache misses is to enable deep lookahead so as to overlap instruction and data supply with instruction processing. A continuous look-ahead process which uses separate thread of control on another hardware contexts is one such approach which we call decoupled look-ahead [1], [2]. However, in such look-ahead schemes, look...

متن کامل

Memory Decoupled Architectures and related issues Guest Editor’s Introduction

It is my great pleasure to serve as guest editor for this special issue of TCCA Newsletter, which is hosting eight papers from the MEDEA (MEmory DEcoupled Architectures) Workshop, jointly held with PACT-2000 conference. The rationale behind this workshop was to revive the original idea of Memory Access Decoupling, presented in the famous paper of Jim Smith, “Decoupled Access/Execute Architectur...

متن کامل

Decoupled Architectures for Complexity-Effective General Purpose Processors

Decoupled architectures have previously been investigated in the context of high performance scientific computing. For general purpose computing, however, superscalar processors have proven to be flexible in providing high performance across a wide range of applications. To achieve this goal, these architectures have incorporated enormous amounts of complexity to obtain modest performance impro...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995